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USE OF PROBABILITY PAPERS
0.

FOREWORD
b) Estimation tion, and c) Estimation of percentiles of distribution of the distribuparameter(s).

0.1 This Indian Standard was adopted by the Bureau of Indian Standards on 23 February 1988, after the draft finalized by the Quality Control and Industrial Statistics Sectional Committee had been approved by the Executive Committee. 0.2 Some commonly used statistical models have been discussed in IS : 9300 ( Part 1 )-1979* and IS : 9300 ( Part 2 )-1979t. However, the selection of a suitable model which represents the given data is also of major concern. For this purpose, two different approaches may be considered, namely, probability plotting method and statistical tests. Probability plotting is a subjective method as the determination of whether or not the data contradicts the assumed model is based on visual examination rather than statistical However, the probability plottin,: calculation. method is very simple and may provide a great deal of useful information in addition to evaluation of appropriateness of the chosen model. 0.3 The technique of probability plotting provides a pictorial representation of data as well as the following: a) Evaluation of reasonableness of the assumed statistical model.
*Statistical models Discrete models. tStatistica1 models Continuous models. for industrial for industrial applications: applications: Part Part 1 2

0.4 In this technique of probability plotting, special probability graph papers have been designed for the distributions. The data is plotted on probability graph paper of an assumed distribution, The operational procedure for plotting has been given in 4 of this standard. If the assumed model is correct, plotted points on the graph paper will tend to fall in a straight line. If the assumed model is not appropriate, the plot will not be linear; and the extent and type of dispersion may be examined. Some guidelines for judging the extent of departure have also been discussed. 0.5 Once priate for model is plot. The has been the chosen model appears to be approthe given data, the parameter(s) of the ( are ) estimated from the probability procedure for estimation of parameter(s) illustrated with examples.

0.6 In reporting the result of a test or analysis, if the final value, observed or calculated, is to be rounded off, it shall be done in accordance with IS : 2-1960*.
*Rules for rounding off numerical values ( revise-d).

1. SCOPE 1.1 This standard covers the probability papers for normal, log-normal, exponential, Weibull, and binomial distributions. The procedure for estimating parameter(s) of the distribution has also been discussed. Each probability paper along with estimation of the parameter(s) has been illustrated with examples. Limitations on the use of probability papers for parameter(s) estimation has also been discussed.

2.2 Log-Normal Distribution - The probability distribution of a continuous random variable X, the probability density function of which is: f(x) = x0>% exp [-(lo~c$-B)`],

O<x<oo 2.3 Exponential Distribution - The probability distribution of a continuous random variable X, the probability density function of which is: f(x) = hexp ( - Ax), x > 0, and A > 0

2. TERMINOLOGY 2.1 Normal Distribution The probability distribution of a continuous random variable X, the probability density function of which is: f(x) = & exp

2.4 Weibull Distribution - The probability distribution of a continuous random variable X, the probability density function of which is:

---<xx++

1
1

f(x) =%( +>a-1

exp \ -

( x/b )a {, x > 0 and determine respectively, B

a and b are positive constants shape and scale-of the distribution

IS t 12348 - 1988 2.5 Binomial Distribution - The probability distribution of a discrete random variable X, such that:
P(X=x) =

x = 0, 1, 2,

," 0

P" ( 1 - P F",
. . . . . .. . n

O<PCl where 0 ?I Z = ?Z! x!(n--x)! ON USE OF PAPERS FOR ESTIMATION

3. LIMITATIONS PROBABILITY PARAMETER

4.1.4 For censored data, n represents the total number of samples, irrespective of whether or not all their values are known. The axis of the graph paper on which the.xi's are plotted will be referred to as the observational scale, and the axis for ( 2i - 1 ) 50/n as the cumulative scale. Whether the observational or cumulative scale appears on the abscissa differs for the various probability papers. However, the cumulative scale is easily recognized by the fact that its values range from a small positive number, such as 0.01 percent ( or 0,000 1 ) to some value close to 100 percent ( or 1-O ), such as 99.99 percent ( or 0.999 9 ). 4.1.5 If a straight line appears to fit the data, draw such a line on the graph paper. 4.1.6 If the chosen model is correct, the points should cluster around the line although there will be some deviations because of random-sampling fluctuations. If the plot deviates appreciably from linearity, the chosen model does not appropriately describe the given data. Systematic departures are indications that the model is not The determination of what may be appropriate. considered a straight line is a subjective matter, and two people looking at the same plot may arrive at different conclusions. However, the larger the number of samples and the greater the divergence from the assumed model, the easier it will be to detect a true departure. 5. GUIDELINES FOR JUDGING THE APPROPRIATENESS OF A PLOT 5.1 The following are the general guidelines given for judging the appropriateness of the assumed model and systematic departures from a straight line. 5.1.1 The dispersion of the plotted points in the tails ( extreme low or extreme high ) will be larger than that of the points at the centre of the distribution. Thus, the relative linearity of the plot near the tails of the distribution will often seem poorer than at the centre of the distribution even if the correct model has been chosen. 5.1.2 The plotted points are ordered and hence are not independent. Thus, one should not expect them to be randomly scattered about a line. For example, the points immediately following a point above the line are also likely to be above the line. Hence, even if the chosen model is correct, the plot may consist of a series of successive points ( known as runs ) above and below the line. 5.1.3 A model can never be proven to be appropriate on the basis of sample data. Thus the probability plot of a small sample taken from a near-normal distribution will frequently not differ appreciably from that of a sample from a normal distribution.

3.1 The use of probability papers for estimation of parameter is essentially based on the estimation of population quantiles ( or percentiles ) by the corresponding sample quantiles. For estimating the two parameters in either the normal or the log-normal or the Weibull distribution, one requires to estimate two quantiles using the probability plots. The choice of these two quantiles is not unique unless determined through the complicated approach to minimize the generalized variance of the estimators over the possible choices. Thus one may use different pairs of quantiles to get different pairs of estimates from the same data for the two parameters in any of these three models. Further, the estimators obtained by this graphical method are not quite efficient compared to the maximum likelihood estimators. These two points should be kept in mind while using the technique of probability plotting. The principal merit of this technique is its simplicity. 4. OPERATIONAL PROBABILlTY PROCEDURE FOR A PLOT FROM THE DATA a probability from a given

4.1 The various steps in preparing plot for a continuous distribution data are as follows.

4.1.1 Obtain a probability paper, designed for the distribution under examination. 4.1.2 Rank the observations in non-decreasing . . . . . . . . . . . . . x (tl) are order. Thus, if x (l), x (2), the original unordered observations. then the ordered observations, to be denoted as xl, xs, _..... . . . .. . . . . , xn, are such that x1 < xs < , . . . . . . . . . . . . . . . < xn.
"

4.1.3 Plot the xI)s on the probability paper versus ( 2i- 1 ) 50/n, or ( 2i - 1 )/2n depending on whether the marked axis on the probability paper refers to the percent or the proportion of observations. It is henceforth, assumed that the probability paper requires the plot of percent observations. Thus the lowest observation is plotted versus 50/n; the second lowest versus 150/n, and so on.

IS:12348 5.1.4 An examination of a plot when there is a systematic departure from a straight line can give' some information about the shape of a more appropriate model. For example, if the plotted lower points are too large and the upper points are too small, compared with those expected from the chosen model, it indicates a distribution which is less peaked than a normal distribution. 5.1.5 If the values the data suggests that should be chosen. If the normal probability bution skewed to the 6. NORMAL in both tails are too large, a model skewed to the right the values in both tails of plot are toosmall, a distrileft would be suggested. PAPER paper is given in estimation of parais given in 6.2.
TABLE 2 VALUES OF xi's AND pi's

- 1988

TABLE

1

INITIAL

SETTING ( Clause 6.3 )

TIME

( MINUTES

)

35 36 38 40 41 42 43 43

45 46 47 47 48 49 49 50

50 50 51 51 52 52 53 53

54 54 55 55 56 57 57 58

59 60 61 63 65 67 69 70

PROBABILITY

6.1 The nbrmal probability Fig. 1. The procedure for meters of normal distribution

( Clous~ 6.3.1 ) Pi (1, 1 2 3 4 5 6 7 8 9 10 11 12 13 14 35 36 38 40 41 42 43 43 45 46 47 47 48 49 1'25 3`75 6.25 8'75 11'25 13.75 16.25 18'75 21'25 23'75 26'25 28'75 31'25 33'75 15 16 17 18 19 20 21 22 23 24 25 26 27 28 ;5) 49 50 50 50 51 51 52 52 53 53 54 54 55 55 i6) 36.25 38.75 41'25 43.75 46'25 48'75 51.25 53'75 56.25 58.75 61'25 63.75 66'25 68'75

6.2 Estimation of Parameters - For normal distribution, mean and median are equal. Therefore, from the probability plot, the 50th percentile or median is estimated as follows:

(5,("8', (9)
29 30 31 32 33 34 35 36 37 38 39 40 56 57 57 58 59 60 61 63 65 67 69 70 71'25 73.75 76.25 78'75 81.25 83.75 86'25 88'75 91.25 93.75 96.25 98'75

Pi

4 b) c)

Locate bution

50 percent scale,

on the cumulative

distri-

Find the intersection from `the plot and and

of the line obtained the 50 percent line, value on the

Read the corresponding observational scale.

6.2.1 In the similar way, the estimate of population standard deviation ( u ) is obtained by using the fact that for any normal distribution, the standard deviation is approximately equal to two-fifth of the difference between the 90th and the 10th percentiles. 6.3 Example - Table 1 gives 40 test results of initial setting time ( in minutes ) of ordinaryportland cement in non-decreasing order. Test for the normality and obtain the parameters of normal distribution. 6.3.1 For each i, the value ofpi = ( 50/n is calculated ( see Table 2 ). The points (pi, x1 ) are plotted on the normal lity paper ( >ee Fig. 1 ). It may be seen plot that the data seem to follow normal tion. 2i - 1 ) pairs of probabifrom this distribu-

7. LOG-NORMAL

PROBABILITY

PAPER

7.1 The probability paper for the log-normal distribution differs from that for the normal distribution in the sense that the axis for the observational scale is logarithmic instead of arithmetic ( see Fig. 2 ). A log-normal probability plot can also be prepared on normal probability paper by using the logarithms of the observations and plotting these against ( 2i - 1 ) 50/n. 7.2 Estimation of Parameters - The procedure for estimating the parameters is similar to that for the normal distribution, except that the logarithms of the observed values are used. Thus for log-normal distribution, mean ( p ) is estimated as the logarithm of the plotted 50th percentile, and standard deviation ( u ) is estimated as twofifths of the difference between the logarithms of the plotted 90th and 10th percentiles. The lognormal probability paper is illustrated with the help of example given in 7.3.

6.3.2 For obtaining the parameters, the values of x1 corresponding to 50th percentile (see Fig. 1 ) is 51.5, which is the mean of the normal distribution. For estimation of standard deviation, the 90th and 10th percentiles are corresponding to the values 62 5 and 40.5 on the observational scale. Thus, i = 2 ( 62.5 40.5 )/5 = 8.8 minutes.
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TABLE 3 DISTANCE IN MILLION TO FAILURE ( Clam 7.3 ) 22'5 37'5 46'0 48.5 51'5 53'0 54'5 57.5 66'5 68.0 69'5 76'5 77'0 78.5 80'0 81'5 82'0 83'0 84'0 91'5 93'5 102'5 107'0 109'5 112'5 113'5 116'0 117'0 118'5 119'0 120'0 122'5 123'0 127'5 131'0 132'5 134'0 METRES

7.3 Example - Anew control device was tested on 96 diesel locomotives. Whenever a device faiied, the distance in kilometres was recorded and the device was returned to the factory for failure analysis. After 135 000 km of service each unfailed device was removed from its locomotive. It was felt that the device could be warranted for 100 000 km. The underlying failure mechanism suggested a log-normal distribution for time to failure. The ordered times for the 37 devices that failed are given in Table 3. Test whether the data follows log-normal distribution. Obtain its parameters and also the probability that the device will not fail in the warranted period. 7.3.1 The data is censored as the values for the remaining 59 devices are known only to be above 135 000 km. But the value of n, the sample size, in the probability plot has to be taken as 96 ( see 4.1.4 ). The values of q's with corresponding probabilities ( pi ) are given in Table 4. 7.3.2 The pair of points (pi, x1 ) are plotted on the log-normal probability paper ( see Fig. 2 ) which suggests that the log-normal distribution is a reasonable choice since the plotted points fall close to the line drawn.
TABLE 4 VALUES

7.3.3

From Fig. 2,

the

50th

percentile

is for

x1 = 1 65 000. Therefore, estimate of mean ( I: ) = log xt = log ( 165 000 ) = 12.01. Similarly the values of xi for 90th and 10th ' percentiles are 400 000 and 66 000, respectively. Therefore, :==, ;2{ log, ( 400 000 ) loge ( 66 000 ) }/5

OF xi's AND pi's

( Clause 7.3.1 ) pi ( 21 -9; (3) 0.5 1'6 2'6 3'6 4.7 5'7 6'8 7'8 8'9 9'9 10'9 12'0 13'0 14'1 15'1 16'1 17'2 18'2 19'3 ,' ) 50 (4) 20 21 22 23 24 ( 21 1 ) 50 96

i

xi

Xl (5) 91'5 93.5 102'5 107'0 108'5 112'5 113'5 116'0 117'0 118'5 119'0 120'0 122'5 123'0 127'5 131'0 132'5 134'0

pi =

(1)
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 IF 17 18 19

(2)
22'5 37'5 46'0 48'5 51'5 53'0 54'5 57'5 66'5 68'0 69'5 7G'5 77'0 78'5 80'0 81'5 82'0 83'0 84'0

(6)
20'3 21'3 22'4 23'4 24'5 25'6 26'6 27.6 28'6 29'7 30'7 31'8 32'8 33'9 34'9 36'0 37'0 38'0

/ 25
26 27 28 29 30 31 32 33 34

T

35 36 37

6

`I

IS t 12348- 1988 7.3.4 From Fig. 2, the value on cumulative scale corresponding to x1 = 1CO 000 km is 24 percent which gives the probability of failure of device in less than 100 000 km. Thus, the estimated reliability ( probability of no failure ) in the first 100 000 km is 76 percent.
8. PROBABILITY represent the data reasonably well. The values of the censored observations may be estimated by determining values on plotted line corresponding to(2i - 1)50/n for i= 1,2,3 and4, thatis, the values on the observational scale corresponding to the values 3.1, 9 4, 15 6 and 21.9 on the cumulative scale. The resulting estimates are 0.000 5, 0.002 8, 0.004 5 and 0.006 5 percent. 8.3.3 As mentioned in 8.2, the exponential distribution is estimated i = ( 3.60 - 2'16 ) _ 2 ( 0.05 - 0.03 ) parameter as: 36.0 of

EXPONENTIAL

PAPER FOR DISTRIBUTION

8.1 The probability

paper for exponential distribution is shown in Fig 3. This paper is further illustrated with the example given in 8.3. 8;2 Estimation of Parameter - The parameter of exponential distribution ( A ) is estimated by the following relation: ;; -_ zr 2 (Y* 21
Yl >

9. WEIBULL

PROBABILITY

PAPER

where ~1 and ya are two values on the observational scale and 21 and zs are the corresponding values from the line at the top of the probability paper.

probability paper ( see Fig. 4 ) and estimation of its parameters are explained with the help of an example given in 9.2. 9.2 Example - The number of cycles to failure in a life test of 16 carbide inserts are given in Table 6. Test if the data follows Weibull distribution and obtain its parameters.
TABLE 6 FAILURE TIME FOR CARBIDE INSERTS

9.1 The use of Weibull

8.3 Example -

In a chemical analysis, a given contaminant can be measured analytically for amounts of 0 01 percent or more. A sample is chosen from each of the 16 lots and the percent contaminant is found to be more than 0.01 percent for 12 of the 16 analysis. Thus, it is a censored sample with the four smallest observations missing. The data after ordering are shown in Table 5. Determine whether the data fits to the exponential distribution and estimate its parameter.

( Clousc9.2 ) 210 271 396 402 619 698 811 900 904 1 008 1 130 1 230 1 256 1 417 1 500

1 621

8.3.1 The values ofpj = ( 21 - 1 ) 50/n have been calculated and given in Table 5. Since this is a censored sample with the four smallest observations missing, the first point offlr corresponds to the fifth ordered sample and is plotted versus 1 ) 50/n, the next value is plotted against (,E- 1 ) 50/n. 8.3.2 straight The closeness of the plotted line indicates that the model points to a appears to

9.2.1 The values of &r's have been calculated in Table 7. These values of pi's when drawn against q's on Weibull probability paper (see Fig. 4 ) indicate a straight line, thereby implying that Weibull distribution seems to be reasonable representation of the data.
9.2.2 To obtain the parameters of Weibull distribution, first the slope and intercept of the line are obtained as follows: Slope 5= m =

w,& -

WI Cl

TABLE

5

PERCENT

CONTAMINANT ( Clause 8.3

IN CHEMICAL

COMPOUND

)
Pl

Pi
t) 1 to 4 5 6 7 8 9 10

(2)
less than 0'010 0 28' 1 34.4 40'6 46'9 53'1 59'4 0'012 0 0'012 8 0.013 9 0'017 1 0'023 1 0'025 0

;4i 11 12 13 14 15 16

(5) 65'6 71'9 78'1 84'4 90.6 96'9

0'030 5

;Q,

0'036 3 0'040 2 0'050 3 0'068 9 0'096 8

7

*

IS : 12348 - 1988 where W, and WI are two values to be read from the right ordinate, and & and & are the corresponding values ( to W, and WI ) to be read from the top abscissa. Intercept = c = the value of W to be read from the right ordinate where the value of < ( to be read from top abscissa ) is zero. The parameters of Weibull distribution, namely, a ( shape parameter ) and b ( scale parameter ) are then obtained by the following relation: A a=m

b^= exp ( -c/m
TABLE i (1) 1 2 3 4 5 6 7 8 xi (2) 210 271 396 402 619 698 811 900 7 VALUES

)

OF xl's AND pi's i xi (5) 904 1 008 1 130 1 230 1 256 1417 1 500 1 621 Pi (6) 53'1 59'4 65'6 71'9 78'1 84'4 90'6 96'9

( Claurr 9.2.1 ) pt3(2i-1) x (3) 3'1 9'4 15'6 21'9 28'1 34'4 40'6 "?t% . 50/16 (4) 9 10 11 12 13 14 15 16

00
0

1

L

9

16

25
x-

36

43

6L

81

100

Fm. 5

QUARTER CIRCLE

for W, = 2-O and 9.2.3 In this example; W, = -8.0, the corresponding yalues of & and <i are 7.98 and 2-88. Therefore, Slope = m E c 20-(-8'o) 7.98 13.5. = 980. PAPER 2.88 ' -1.96 and

( x,y ) with the origin. Extrapolate it, and read the scale of vertical axis at the intersection of the line and the quarter circle. This is the point estimate of percent defective. Draw two lines around the point ( x, y ) with a radius of one centimetre. Draw two lines which pass through the origin and touch the circle at the upper and lower circumferences. Read the scale of vertical axis for the intersection of two lines and the quarter circle. These two `values give the confidence limits for percent defectives.

'

Intercept b=

Therefore, a = exp{-(--13'5)/

1.96 and l-96)

10. BINOMIAL

PROBABILITY

10.1 For the construction of binomial probability paper, the horizontal and vertical axis are scaled so that from origin the point 1 cm is scaled by la, 2 cm by 2a . . . . . . . . . , d cm by da, and so on. The points on the plane are denoted by ( x, y ), where x is the scale of the horizontal axis ( abscissa ), and _y that of the vertical axis. x and y denote the number of non-defectives and defectives, respectively in a sample of size n ( x + y = n ). A quarter circle with centre at origin and radius of 100 (x=y= lOO)isdrawn(seeFig.5). 10.2 Procedure Defective and sample from an the number of defectives (y). binomial paper for E&nation of Percent Confidepce Interval - Take a unknown population and count non-defectives (x) and that of Plot the point ( x, y ) on the ( seeFig. 6 ). Connect the point 10

t >

0

1

L

9

16

25 XY

36

L9

6L

81

100

.
FIG. 6

BINOMIAL PROBABILITY PAPER

10.3 Example - When the sample size is 30 and the number of defectives in the sample is 9, the point estimate and the confidence limits of fraction defective are obtained as 30 percent and 15-47 percent, respectively ( see Fig. 6 ). B
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